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(2) INFORMATION FOR SEQ ID N0:1: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 





(xi) SEQUENCE 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



'I. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Itcgagttta CCACTCCCTA TCAGTGATAG AGAAAAGTGA AAGTCGAGTT TACCACTCCC 60 

ts^TCAGTGAT AGAGAAAAGT GAAAGTCGAG TTTACCACTC CCTATCAGTG ATAGAGAAAA 120 

yrCAT^GTCG AGTTTACCAC TCCCTATCAG TGATAGAGAA AAGTGAAAGT CGAGTTTACC 180 

MtTCCCTATC AGTGATAGAG AAAAGTGAAA GTCGAGTTTA CCACTCCCTA TCAGTGATAG 24 0 

M3AAAAGTGA AAGTCGAGTT TACCACTCCC TATCAGTGAT AGAGAAAAGT GAAAGTCGAG 300 

CTCGGTACCC GGGTCGAGTA GGCGTGTACG GTGGGAGGCC TATATAAGCA GAGCTCGTTT 360 

AGTGAACCGT CAGATCGCCT GGAGACGCCA TCCACGCTGT TTTGACCTCC ATAGAAGACA 420 

CCGGGACCGA TCCAGCCTCC GCGGCCCCGA ATTAGCTTGA TATCGAATTC 470 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGTACCACTA GTAAGCTTAG ATCCACTGTC TGGGATTATA TCAGGACAAC CGAAGCCTGG 60 
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AAAGTGTATT AGGTAGAGCA TTTTCTTCCA CGTGTTTGGG CACGTTTCCG ACAGCTAGGA 120 

TTCCAGCTCT GTCTTTGTAT GTTACAGACT GTAT^TCAAT CGCAGGTGAA ACTGTTTGGA 180 

CAGTAGGTGG GGATCAAAGA CCCTCCGCCC GTGAGACTCT AGGCGCTTTC CCCTGCCACC 240 

AGCCTGTCTC CAGAGATGCT CTGGAAGGAG GCGGGCCCGG GCGGTCTTTC TGCTCTTTAG 300 

CGTGGCGGAC GCGGCGGCGG GGGCAGGGCT GGAGCAGAGA GCGCTGCAGT GATAGAACTT 360 

TCTGACCCCG CTGCGCAGGG CGGCAGGGTG GCAGGGTGGC AGGGTGGCGA GCTAAGCCAG 420 

AGCCGAACGC TGGAGCTCTG GGAGGAACAT CGAAGGTTTG TATGTGGTCT GAGATCGGCC 480 

TGACTATATT TTTTTGTCCT AAATTTGCAA GCACACACCC ACAAAGCTGC GGTCTTGACC 540 

GGTATTCTTT ATAGAGCGCA ATGGAGTGAG CTGAGTGTCT AAACGATTTC CCTAATTCAT 600 

CTGATAGCAG AGGCGCTCTC CTAATTGGCG AAGAGCTGCC TCATGTCCGC AACTTTTTGG 660 

CAGAGTGAAT TCCACAGCTT TGTGTGTGTG TGTGGGGGGG GGTGTAAGGG GTGTCTAAAA 720 

CTTTCGGTCT CCTACTATTC TGTATCTCGA CCGGTTGGTT TTACACCCCG GCTCATCTCA 780 

TCAACGCAAA CACCCCCACT CTCCTATGGA CCCAAGGACC TGACGTGGGG GAAGGTGGAC 840 

ATTAGGAATG TCAGAAACCT AGAGTCCACG CTCCTCCTCT CCATCTTTCC ACGAGTTTGG 900 

GAAACTTCTT GGCTGCGAAG ACTTTGACCC ACATCTGCAT TTCTCAGCCC CAGCTTCCAA 960 

AAGTGCTGCA GGTTCGGGAG GGGAGACCTC AGTCCTCCTT TGTGAGGCTT GTTTGCGTTG 1020 

AGGGATTGGC AGCGATGGCT TCCAGATGGG CTGAAACCCT GCCCGTATTT ATTTAAACTG 1080 

GTTCCTCGTG GAGAGCTGTG AATCGGGCTC TGTATGCGCT CGAGAAAAGC CCCATTCATG 1140 

AGAGGCAAGG CCCAGTGGGT CCCCCCGACT CCCCGACCCC CCTCTCCCAC AATATATCCC 1200 

^eCCTCCCTGT GCCCGCCTGC CGCCACCTCC CGGGCTCCGG CCCCGCGCGC AGCGGCGACG 1260 

SaGCAACACA GTTCCCCGAA AGAGGTAGCT TTTTAATTGG CCAGCCACAA AGAATCACTT 1320 

ItGCCGCACG GCGGTAACGA GGGGAACCGG ATCGGGCGGC CAGGATGCTA TCTGTGTAGC 13 80 

CCTTTTCGTG CCACAATTAG GGTGGTGCTG GCTTCCTCCG ACCGCACCTA GGCGATCTGG 1440 

StACACTGTT GGCTCCTTTC TTGGGCAGTC ATTTAATCCT ACTTTTTACT CTACGAATGT 1500 

GTGTCTGATG GAGGGCTGTG TCCGGAGCCC CATCCACAAA GAGTCAGCCA GCAGCTCTCA 1560 

'GACCCGGCTG GATCTCATAT GGTGCACTCT CAGTACAATC TGCTCTGATG CCGCATAGTT 1620 

^2^GCCAGCCA AGCTAGCTTG CGCAAGCTAG CTTGCGATCC GTAAAAATGT GTGAGAGTTA 1680 

%AAATGTCT TCCGGGCTAA GATCCGACAG CCATGGTCCA AAGAAGACTT CGGCACTGCA 1740 

^GACTTAAAAC CAGCTTTCTA GCAGAGGCAG AAGGATCTAG AGCCAAAGGC AAAGACTTGA 1800 

k^AGGCTGGG AAGATGCAAG AATGGCATTT TACATAAAGA ACACTCTCTC CTTTTCCAGC 1860 

mGCACACTT GCATAGAAAT TAAGTTTTAC ACTTGAAGTT CTTTGTTTCC ATCCTGAGAA 1920 

SCTCCAAAGT CTGAGGTGGT GTGGTATGCT GGGTAATTCT CCCCACCCCC CAACATTCCC 1980 

SGGGGGTTCC ATGGGGGTAG CTTCTCCCAA GGACTTCCAG CGGCAACACA GAAATCCCAC 2040 

mrCGAGACAA AGGAGTTACT GCTTAAATCA GGCCCTAATT TCCAAGGTTC CCTTTGCTTA 2100 

iAGTTCCCTA GAGGACCATC TCACTTCTAA AGAAAAGGTG TATTCGGGGA CCCATCCTCA 2160 

ACCTCCTTGT TATGGAAGGA GACTTCGGGA ACAGAGCAAG GGCTGAGCCT CCGGCAGTTT 2220 

GGGGTAAGGT TGGGGTTGGG GGGAGCAAGG AAGGCAAGTG AGGCTGGAGG CCCAGGGATA 2280 

GGGGAAGATG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TCGGGGATGG 2340 

TGGTGGTGGA CAACTAGGAA ACTCTGGCGC TTTCTCCTCC CCTCACAAAA CTGAGTCCAG 2400 

CTGGAGCCGC CTCCAGACTC TCTGGCCAGG GCCTCAGAGT GGTCAACAGT CCCTGGCCAG 2460 

CGTTGCTCTC TCCAGGCTAA GGGCACCCAC TCCCCTGGAG ATTCCTGAAC CTGGGCCAGG 2520 

AAGAGCCGAA TTAGACAAGT GTCTCCAATC CGGCTGCGTG CGGATTTTGT TGCGGTGTCC 2580 

CTCGGTTGTC TGCAGTTCCT TTAGTCCCTT CCCTGGCCTG CCCCTTACAC CTCCACACAG 2640 

GTCCCCCTCT GTGTAGGAAT ACACCAGACC CTCTCTTAGC CACACACACC TCCAGTCCCC 2700 

CGTCTACCTA GATTTTTTTC ATAGCTAGTT GGATGGGGGA TGGGTTAGGG AGGCTGGGTT 2760 

TGCGAGCCTC CAGGTGGGAG TTCACCGACA GGTACTCCGC AAAGGAGCTG GAAGGCAGGT 2820 

CTGGAAAACT GTCCCCCAGA TTTAGGATTC TGGGCAGCTT CCATCAGCTT ATACTTTGGC 2880 

TCCCCCGCCC CCTAAACTCC CCATCCCCAC CTTCCTTTCT CCCGTTACTT CGTCCTCCCT 294 0 

CGCCTTTCCA GCCTTGAGTC TAAAGCTCCA TGCTTATGCC TCTGCAAACA ACCCCCTCCC 3000 

TTCTAACCCC AGCAGAACTC CGAGGAAAGG GGCCGGAGGC CCCCCTTCTC GCCTGTGGTT 3060 

AGAGGGGGCA GTGTGGCAGT CCCAAGTGGG GGCGACCGGA GGCCGTCTCG GTGCCCCGCC 3120 



r. 
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CGATCAGGCC ACTGGGCACA TCGGGGGCGG GAAGCTGGGC TCACCAAAGG GGCGACTGGC 3180 

CTTGGCAGGT GTGGGCTCTG GTCCGGCCTG GGCAGGCTCC GGGGGCGGGG TCTCAGGTTA 3240 

CAGCCCCGCG GGGGGCTGGG GGGCGGCCCG CGGTTTGGGC TGGTTTGCCA GCCTTTGGAG 3300 

CGACCGGGAG CATATAACCG GAGCCTCTGC TGGGAGAAGA CGCAGAGCGC CGCTGGGCTG 3360 

CCGGGTCTCC TGCCTCCTCC TCCTGCTCCT AGAGCCTCCT GCATGAGGGC GCGGTAGAGA 3420 

CCCGGACCCG CTCCGTGCTC TGCCGCCTCG CCGAGCTTCG CCCGCAAGCT GGGGAATTC 347 9 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

J>ro Arg Cys Gly Val Pro Asp Val 

0 1 5 

\| (2) INFORMATION FOR SEQ ID NO: 5: 

Sd (i) SEQUENCE CHARACTERISTICS: 

£ (A) LENGTH: 3 9 base pairs 

y^i (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
f (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

1 y 

Agccaagat GCGGGGTTGT CGATGTGGGT GAATACAAT 3 9 

p 

U (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GAAAAAGCCA AGATGCGGGG GTCCTGATGT GGGTGAATAC 
(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GGTACCACTA GTAAGCTTAG ATCTCATATG GTCGACCCCG GGGAATTCCT GCAGGGATCC 60 
TCTAGAAGTA CTCCATGGGT ATACATCGAT GCGGCCGC 98 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2792 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



a 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 



^Itcgagttta CCACTCCCTA TCAGTGATAG AGAAAAGTGA AAGTCGAGTT TACCACTCCC 60 

&IATCAGTGAT AGAGAAAAGT GAAAGTCGAG TTTACCACTC CCTATCAGTG ATAGAGAAAA 120 

ItGAAAGTCG AGTTTACCAC TCCCTATCAG TGATAGAGAA AAGTGAAAGT CGAGTTTACC 180 

IlCTCCCTATC AGTGATAGAG AAAAGTGAAA GTCGAGTTTA CCACTCCCTA TCAGTGATAG 240 

SgAAAAGTGA AAGTCGAGTT TACCACTCCC TATCAGTGAT AGAGAAAAGT GAAAGTCGAG 3 00 

'^dTCGGTACCC GGGTCGAGTA GGCGTGTACG GTGGGAGGCC TATATAAGCA GAGCTCGTTT 3 60 

'AGTGAACCGT CAGATCGCCT GGAGACGCCA TCCACGCTGT TTTGACCTCC ATAGAAGACA 420 

^tCGGGACCGA TCCAGCCTCC GCGGCCCCGA ATTAGCTTGA TATCGAATTC GAGCTCGGTA 4 80 

?eCCGGGGATC CTCTAGACAA GATGCATCCA GGGGTCCTGG CTGCCTTCCT CTTCTTGAGC 540 

ifGGACTCATT GTCGGGCCCT GCCCCTTCCC AGTGGTGGTG ATGAAGATGA TTTGTCTGAG 600 

SQAAGACCTCC AGTTTGCAGA GCGCTACCTG AGATCATACT ACCATCCTAC AAATCTCGCG 660 

SGAATCCTGA AGGAGAATGC AGCAAGCTCC ATGACTGAGA GGCTCCGAGA AATGCAGTCT 720 

giTCTTCGGCT TAGAGGTGAC TGGCAAACTT GACGATAACA CCTTAGATGT CATGAAAAAG 780 

CCAAGATGCG GGGTTGTCGA TGTGGGTGAA TACAATGTTT TCCCTCGAAC TCTTAAATGG 840 

TCCAAAATGA ATTT7VACCTA CAGAATTGTG AATTACACCC CTGATATGAC TCATTCTGAA 900 

GTCGAAAAGG CATTCAAAAA AGCCTTCAAA GTTTGGTCCG ATGTAACTCC TCTGAATTTT 960 

ACCAGACTTC ACGATGGCAT TGCTGACATC ATGATCTCTT TTGGAATTAA GGAGCATGGC 1020 

GACTTCTACC CATTTGATGG GCCCTCTGGC CTGCTGGCTC ATGCTTTTCC TCCTGGGCCA 1080 

AATTATGGAG GAGATGCCCA TTTTGATGAT GATGAAACCT GGACAAGTAG TTCCAAAGGC 1140 

TACAACTTGT TTCTTGTTGC TGCGCATGAG TTCGGCCACT CCTTAGGTCT TGACCACTCC 1200 

AAGGACCCTG GAGCACTCAT GTTTCCTATC TACACCTACA CCGGCAAAAG CCACTTTATG 1260 

CTTCCTGATG ACGATGTACA AGGGATCCAG TCTCTCTATG GTCCAGGAGA TGAAGACCCC 1320 

AACCCTAAAC ATCCAAAAAC GCCAGACAAA TGTGACCCTT CCTTATCCCT TGATGCCATT 1380 

ACCAGTCTCC GAGGAGAAAC AATGATCTTT AAAGACAGAT TCTTCTGGCG CCTGCATCCT 1440 

CAGCAGGTTG ATGCGGAGCT GTTTTTAACG AAATCATTTT GGCCAGAACT TCCCAACCGT 1500 

ATTGATGCTG CATATGAGCA CCCTTCTCAT GACCTCATCT TCATCTTCAG AGGTAGAAAA 1560 

TTTTGGGCTC TTAATGGTTA TGACATTCTG GAAGGTTATC CCAAAAAAAT ATCTGAACTG 1620 

GGTCTTCCAA AAGAAGTTAA GAAGATAAGT GCAGCTGTTC ACTTTGAGGA TACAGGCAAG 1680 
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ACTCTCCTGT TCTCAGGAAA CCAGGTCTGG AGATATGATG ATACTAACCA TATTATGGAT 1740 

AAAGACTATC CGAGACTAAT AGAAGAAGAC TTCCCAGGAA TTGGTGATAA AGTAGATGCT 1800 

GTCTATGAGA AAAATGGTTA TATCTATTTT TTCAACGGAC CCATACAGTT TGAATACAGC 1860 

ATCTGGAGTA ACCGTATTGT TCGCGTCATG CCAGCAAATT CCATTTTGTG GTGTTAAGTG 1920 

TCTTTTTAAA AATTGTTATT TAAATCCTGA AGAGCATTTG GGGTAATACT TCCAGAAGTG 1980 

CGGGGTAGGG GAAGAAGAGC TATCAGGAGA AAGCTCTAGT TCTAGAGGGC CCTATTCTAT 2040 

AGTGTCACCT AAATGCTAGA GGATCTTTGT GAAGGAACCT TACTTCTGTG GTGTGACATA 2100 

ATTGGACAAA CTACCTACAG AGATTTAAAG CTCTAAGGTA AATATAAAAT TTTTAAGTGT 2160 

ATAATGTGTT AAACTACTGA TTCTAATTGT TTGTGTATTT TAGATTCCAA CCTATGGAAC 2220 

TGATGAATGG GAGCAGTGGT GGAATGCCTT TAATGAGGAA AACCTGTTTT GCTCAGAAGA 2280 

AATGCCATCT AGTGATGATG AGGCTACTGC TGACTCTCAA CATTCTACTC CTCCAAAAAA 2340 

GAAGAGAAAG GTAGAAGACC CCAAGGACTT TCCTTCAGAA TTGCTAAGTT TTTTGAGTCA 2400 

TGCTGTGTTT AGTAATAGAA CTCTTGCTTG CTTTGCTATT TACACCACAA AGGAAAAAGC 2460 

TGCACTGCTA TACAAGAAAA TTATGGAAAA ATATTTGATG TATAGTGCCT TGACTAGAGA 2520 

TCATAATCAG CCATACCACA TTTGTAGAGG TTTTACTTGC TTTAAAAAAC CTCCCACACC 2580 

TCCCCCTGAA CCTGAAACAT AAAATGAATG CAATTGTTGT TGTTAACTTG TTTATTGCAG 2640 

CTTATAATGG TTACAAATAA AGCAATAGCA TCACAAATTT CACAAATAAA GCATTTTTTT 2700 

CACTGCATTC TAGTTGTGGT TTGTCCAAAC TCATCAATGT ATCTTATCAT GTCTGGATCA 2760 

TCCCGCCATG GGTATACATC GATGCGGCCG CC 2792 

Q (2) INFORMATION FOR SEQ ID NO : 9 : 



^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5276 base pairs 
M (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
HI (D) TOPOLOGY: linear 

s (ii) MOLECULE TYPE: cDNA 



U (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

f% ii 

^TACCACTA GTAAGCTTAG ATCCACTGTC TGGGATTATA TCAGGACAAC CGAAGCCTGG 60 

S^GTGTATT AGGTAGAGCA TTTTCTTCCA CGTGTTTGGG CACGTTTCCG ACAGCTAGGA 120 

|i?CCAGCTCT GTCTTTGTAT GTTACAGACT GTAAATCAAT CGCAGGTGAA ACTGTTTGGA 180 

S^GTAGGTGG GGATCAAAGA CCCTCCGCCC GTGAGACTCT AGGCGCTTTC CCCTGCCACC 240 

AGCCTGTCTC CAGAGATGCT CTGGAAGGAG GCGGGCCCGG GCGGTCTTTC TGCTCTTTAG 300 

CGTGGCGGAC GCGGCGGCGG GGGCAGGGCT GGAGCAGAGA GCGCTGCAGT GATAGAACTT 360 

TCTGACCCCG CTGCGCAGGG CGGCAGGGTG GCAGGGTGGC AGGGTGGCGA GCTAAGCCAG 420 

AGCCGAACGC TGGAGCTCTG GGAGGAACAT CGAAGTGTTT GTATGTGGTC TGAGATCGGC 480 

CTGACTATAT TTTTTTGTCC TAAATTTGCA AGCACACACC CACAAAGCTG CGGTCTTGAC 540 

CGGTATTCTT TATAGAGCGC AATGGAGTGA GCTGAGTGTC TAAACGATTT CCCTAATTCA 600 

TCTGATAGCA GAGGCGCTCT CCTAATTGGC GAAGAGCTGC CTCATGTCCG CAACTTTTTG 660 

GCAGAGTGAA TTCCACAGCT TTGTGTGTGT GTGTGGGGGG GGGTGTAAGG GGTGTCTAAA 720 

ACTTTCGGTC TCCTACTATT CTGTATCTCG ACCGGTTGGT TTTACACCCC GGCTCATCTC 780 

ATCAACGCAA ACACCCCCAC TCTCCTATGG ACCCAAGGAC CTGACGTGGG GGAAGGTGGA 840 

CATTAGGAAT GTCAGAAACC TAGAGTCCT^C GCTCCTCCTC TCCATCTTTC CACGAGTTTG 900 

GGAAACTTCT TGGCTGCGAA GACTTTGACC CACATCTGCA TTTCTCAGCC CCAGCTTCCA 960 

AAAGTGCTGC AGGTTCGGGA GGGGAGACCT CAGTCCTCCT TTGTGAGGCT TGTTTGCGTT 1020 

GAGGGATTGG CAGCGATGGC TTCCAGATGG GCTGAAACCC TGCCCGTATT TATTTAAACT 1080 

GGTTCCTCGT GGAGAGCTGT GAATCGGGCT CTGTATGCGC TCGAGAAAAG CCCCATTCAT 1140 
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GAGAGGCAAG GCCCAGTGGG TCCCCCCGAC TCCCCGACCC CCCTCTCCCA CAATATATCC 1200 

CCCCTCCCTG TGCCCGCCTG CCGCCACCTC CCGGGCTCCG GCCCCGCGCG CAGCGGCGAC 1260 

GAAGCAACAC AGTTCCCCGA AAGAGGTAGC TTTTTAATTG GCCAGCCACA AAGAATCACT 1320 

TATGCCGCAC GGCGGTAACG AGGGGAACCG GATCGGGCGG CCAGGATGCT ATCTGTGTAG 1380 

CCCTTTTCGT GCCACAATTA GGGTGGTGCT GGCTTCCTCC GACCGCACCT AGGCGATCTG 1440 

GTTACACTGT TGGCTCCTTT CTTGGGCAGT CATTTAATCC TACTTTTTAC TCTACGAATG 1500 

TCTGTCTGAT GGAGGGCTGT GTCCGGAGCC CCATCCACAA AGAGTCAGCC AGCAGCTCTC 1560 

ACACCCGGCT GGATCTCATA TGGTGCACTC TCAGTACAAT CTGCTCTGAT GCCGCATAGT 1620 

TAAGCCAGCC AAGCTAGCTT GCGCAAGCTA GCTTGCGATC CGTAAAAATG TGTGAGAGTT 1680 

ACAAAATGTC TTCCGGGCTA AGATCCGACA GCCATGGTCC AAAGAAGACT TCGGCACTGC 1740 

AGACTTAAAA CCAGCTTTCT AGCAGAGGCA GAAGGATCTA GAGCCAAAGG CAAAGACTTG 1800 

AATAGGCTGG GAAGATGCAA GAATGGCATT TTACATAAAG AACACTCTCT CCTTTTCCAG 1860 

CCAGCACACT TGCATAGAAA TTAAGTTTTA CACTTGAAGT TCTTTGTTTC CATCCTGAGA 1920 

AGCTCCAAAG TCTGAGGTGG TGTGGTATGC TGGGTAATTC TCCCCACCCC CCAACATTCC 1980 

CTGGGGGTTC CATGGGGGTA GCTTCTCCCA AGGACTTCCA GCGGCAACAC AGAAATCCCA 2040 

CTTCGAGACA AAGGAGTTAC TGCTTAAATC AGGCCCTAAT TTCCAAGGTT CCCTTTGCTT 2100 

AAAGTTCCCT AGAGGACCAT CTCACTTCTA AAGAAAAGGT GTATTCGGGG ACCCATCCTC 2160 

AACCTCCTTG TTATGGAAGG AGACTTCGGG AACAGAGCAA GGGCTGAGCC TCCGGCAGTT 2220 

TGGGGTAAGG TTGGGGTTGG GGGGAGCAAG GAAGGCAAGT GAGGCTGGAG GCCCAGGGAT 2280 

fSGGGGAAGAT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT CTCGGGGATG 2340 

StGGTGGTGG ACAACTAGGA AACTCTGGCG CTTTCTCCTC CCCTCACAAA ACTGAGTCCA 2400 

ScTGGAGCCG CCTCCAGACT CTCTGGCCAG GGCCTCAGAG TGGTCAACAG TCCCTGGCCA 2460 

=(iCGTTGCTCT CTCCAGGCTA AGGGCACCCA CTCCCCTGGA GATTCCTGAA CCTGGGCCAG 2520 

tGAAGAGCCGA ATTAGACAAG TGTCTCCAAT CCGGCTGCGT GCGGATTTTG TTGCGGTGTC 2580 

ICTCGGTTGT CTGCAGTTCC TTTAGTCCCT TCCCTGGCCT GCCCCTTACA CCTCCACACA 2640 

ilGTCCCCCTC TGTGTAGGAA TACACCAGAC CCTCTCTTAG CCACACACAC CTCCAGTCCC 2700 

'iCGTCTACCT AGATTTTTTT CATAGCTAGT TGGATGGGGG ATGGGTTAGG GAGGCTGGGT 2760 

4tGCGAGCCT CCAGGTGGGA GTTCACCGAC AGGTACTCCG CAAAGGAGCT GGAAGGCAGG 2820 

■^TCTCGAAAAC TGTCCCCCAG ATTTAGGATT CTGGGCAGCT TCCATCAGCT TATACTTTGG 2880 

^CTCCCCCGCC CCCTAAACTC CCCATCCCCA CCTTCCTTTC TCCCGTTACT TCGTCCTCCC 2 940 

^iCGCCTTTCC AGCCTTGAGT CTAAAGCTCC ATGCTTATGC CTCTGCAAAC AACCCCCTCC 3 000 

^€TTCTAACCC CAGCAGAACT CCGAGGAAAG GGGCCGGAGG CCCCCCTTCT CGCCTGTGGT 3060 

QaGAGGGGGC AGTGTGGCAG TCCCAAGTGG GGGCGACCGG AGGCCGTCTC GGTGCCCCGC 3120 

BCGATCAGGC CACTGGGCAC ATCGGGGGCG GGAAGCTGGG CTCACCAAAG GGGCGACTGG 3180 

©CTTGGCAGG TGTGGGCTCT GGTCCGGCCT GGGCAGGCTC CGGGGGCGGG GTCTCAGGTT 3240 

ACAGCCCCGC GGGGGGCTGG GGGGCGGCCC GCGGTTTGGG CTGGTTTGCC AGCCTTTGGA 33 00 

GCGACCGGGA GCATATAACC GGAGCCTCTG CTGGGAGAAG ACGCAGAGCG CCGCTGGGCT 3360 

GCCGGGTCTC CTGCCTCCTC CTCCTGCTCC TAGAGCCTCC TGCATGAGGG CGCGGTAGAG 3420 

ACCCGGACCC GCTCCGTGCT CTGCCGCCTC GCCGAGCTTC GCCCGCAAGC TGGGGAATTC 3480 

ATATGTCTAG ATTAGATAAA AGTAAAGTGA TTAACAGCGC ATTAGAGCTG CTTAATGAGG 3540 

TCGGAATCGA AGGTTTAACA ACCCGTAAAC TCGCCCAGAA GCTAGGTGTA GAGCAGCCTA 3 600 

CATTGTATTG GCATGTAAAA AATAAGCGGG CTTTGCTCGA CGCCTTAGCC ATTGAGATGT 3 660 

TAGATAGGCA CCATACTCAC TTTTGCCCTT TAGAAGGGGA AAGCTGGCAA GATTTTTTAC 3720 

GTAATAACGC TAAAAGTTTT AGATGTGCTT TACTAAGTCA TCGCGATGGA GCAAAAGTAC 3780 

ATTTAGGTAC ACGGCCTACA GAAAAACAGT ATGAAACTCT CGAAAATCAA TTAGCCTTTT 3 840 

TATGCCAACA AGGTTTTTCA CTAGAGAATG CATTATATGC ACTCAGCGCT GTGGGGCATT 3 900 

TTACTTTAGG TTGCGTATTG GAAGATCAAG AGCATCAAGT CGCTAAAGAA GAAAGGGAAA 3960 

CACCTACTAC TGATAGTATG CCGCCATTAT TACGACAAGC TATCGAATTA TTTGATCACC 4020 

AAGGTGCAGA GCCAGCCTTC TTATTCGGCC TTGAATTGAT CATATGCGGA TTAGAAAAAC 4080 

AACTTAAATG TGAAAGTGGG TCCGCGTACA GCCGCGCGCG TACGAAAAAC AATTACGGGT 4140 

CTACCATCGA GGGCCTGCTC GATCTCCCGG ACGACGACGC CCCCGAAGAG GGGGGGCTGG 4200 
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CGGCTCCGCG CCTGTCCTTT CTCCCCGCGG GACACACGCG CAGACTGTCG ACGGCCCCCC 4260 

CGACCGATGT CAGCCTGGGG GACGAGCTCC ACTTAGACGG CGAGGACGTG GCGATGGCGC 4320 

ATGCCGACGC GCTAGACGAT TTCGATCTGG ACATGTTGGG GGACGGGGAT TCCCCGGGTC 4380 

CGGGATTTAC CCCCCACGAC TCCGCCCCCT ACGGCGCTCT GGATATGGCC GACTTCGAGT 4440 

TTGAGCAGAT GTTTACCGAT GCCCTTGGAA TTGACGAGTA CGGTGGGTAG GGGGCGCGAG 4500 

GATCCTCTAG AGGGCCCTAT TCTATAGTGT CACCTAAATG CTAGAGGATC TTTGTGAAGG 4560 

AACCTTACTT CTGTGGTGTG ACATAATTGG ACAAACTACC TACAGAGATT TAAAGCTCTA 4620 

AGGTAAATAT AAAATTTTTA AGTGTATAAT GTGTTAAACT ACTGATTCTA ATTGTTTGTG 4680 

TATTTTAGAT TCCAACCTAT GGAACTGATG AATGGGAGCA GTGGTGGAAT GCCTTTAATG 4740 

AGGAAAACCT GTTTTGCTCA GAAGAAATGC CATCTAGTGA TGATGAGGCT ACTGCTGACT 4800 

CTCAACATTC TACTCCTCCA AAAAAGAAGA GAAAGGTAGA AGACCCCAAG GACTTTCCTT 4860 

CAGAATTGCT AAGTTTTTTG AGTCATGCTG TGTTTAGTAA TAGAACTCTT GCTTGCTTTG 4920 

CTATTTACAC CACAAAGGAA AAAGCTGCAC TGCTATACAA GAAAATTATG GAAAAATATT 4980 

TGATGTATAG TGCCTTGACT AGAGATCATA ATCAGCCATA CCACATTTGT AGAGGTTTTA 5040 

CTTGCTTTAA AAAACCTCCC ACACCTCCCC CTGAACCTGA AACATAAAAT GAATGCAATT 5100 

GTTGTTGTTA ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA 5160 

AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC 5220 

AATGTATCTT ATCATGTCTG GATCATCCCG CCATGGGTAT ACATCGATGC GGCCGC 5276 

(2) INFORMATION FOR SEQ ID NO: 10: 

% (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7664 base pairs 
i2 (B) TYPE: nucleic acid 

r. (C) STRANDEDNESS : single 

™ (D) TOPOLOGY: linear 

2 (ii) MOLECULE TYPE: cDNA 

' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

feTACCACTA GTAAGCTTAG 
MI^GTGTATT AGGTAGAGCA 
^CCAGCTCT GTCTTTGTAT 
□^GTAGGTGG GGATCAAAGA 
gGCCTGTCTC CAGAGATGCT 
CGTGGCGGAC GCGGCGGCGG 
TCTGACCCCG CTGCGCAGGG 
AGCCGAACGC TGGAGCTCTG 
CTGACTATAT TTTTTTGTCC 
CGGTATTCTT TATAGAGCGC 
TCTGATAGCA GAGGCGCTCT 
GCAGAGTGAA TTCCACAGCT 
ACTTTCGGTC TCCTACTATT 
ATCAACGCAA ACACCCCCAC 
CATTAGGAAT GTCAGAAACC 
GGAAACTTCT TGGCTGCGAA 
AAAGTGCTGC AGGTTCGGGA 
GAGGGATTGG CAGCGATGGC 
GGTTCCTCGT GGAGAGCTGT 
GAGAGGCAAG GCCCAGTGGG 



ATCCACTGTC TGGGATTATA TCAGGACAAC CGAAGCCTGG 60 

TTTTCTTCCA CGTGTTTGGG CACGTTTCCG ACAGCTAGGA 120 

GTTACAGACT GTAAATCAAT CGCAGGTGAA ACTGTTTGGA 180 

CCCTCCGCCC GTGAGACTCT AGGCGCTTTC CCCTGCCACC 240 

CTGGAAGGAG GCGGGCCCGG GCGGTCTTTC TGCTCTTTAG 300 

GGGCAGGGCT GGAGCAGAGA GCGCTGCAGT GATAGAACTT 360 

CGGCAGGGTG GCAGGGTGGC AGGGTGGCGA GCTAAGCCAG 42 0 

GGAGGAACAT CGAAGTGTTT GTATGTGGTC TGAGATCGGC 480 

TAAATTTGCA AGCACACACC CACAAAGCTG CGGTCTTGAC 540 

AATGGAGTGA GCTGAGTGTC TAAACGATTT CCCTAATTCA 600 

CCTAATTGGC GAAGAGCTGC CTCATGTCCG CAACTTTTTG 660 

TTGTGTGTGT GTGTGGGGGG GGGTGTAAGG GGTGTCTAAA 720 

CTGTATCTCG ACCGGTTGGT TTTACACCCC GGCTCATCTC 780 

TCTCCTATGG ACCCAAGGAC CTGACGTGGG GGAAGGTGGA 840 

TAGAGTCCAC GCTCCTCCTC TCCATCTTTC CACGAGTTTG 900 

GACTTTGACC CACATCTGCA TTTCTCAGCC CCAGCTTCCA 960 

GGGGAGACCT CAGTCCTCCT TTGTGAGGCT TGTTTGCGTT 1020 

TTCCAGATGG GCTGAAACCC TGCCCGTATT TATTTAAACT 1080 

GAATCGGGCT CTGTATGCGC TCGAGAAAAG CCCCATTCAT 1140 

TCCCCCCGAC TCCCCGACCC CCCTCTCCCA CAATATATCC 12 00 
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CCCCTCCCTG TGCCCGCCTG CCGCCACCTC CCGGGCTCCG GCCCCGCGCG CAGCGGCGAC 1260 

GAAGCAACAC AGTTCCCCGA AAGAGGTAGC TTTTTAATTG GCCAGCCACA AAGAATCACT 132 0 

TATGCCGCAC GGCGGTAACG AGGGGAACCG GATCGGGCGG CCAGGATGCT ATCTGTGTAG 1380 

CCCTTTTCGT GCCACAATTA GGGTGGTGCT GGCTTCCTCC GACCGCACCT AGGCGATCTG 144 0 

GTTACACTGT TGGCTCCTTT CTTGGGCAGT CATTTAATCC TACTTTTTAC TCTACGAATG 1500 

TCTGTCTGAT GGAGGGCTGT GTCCGGAGCC CCATCCACAA AGAGTCAGCC AGCAGCTCTC 1560 

ACACCCGGCT GGATCTCATA TGGTGCACTC TCAGTACAAT CTGCTCTGAT GCCGCATAGT 1620 

TAAGCCAGCC AAGCTAGCTT GCGCAAGCTA GCTTGCGATC CGTAAAAATG TGTGAGAGTT 1680 

ACAAAATGTC TTCCGGGCTA AGATCCGACA GCCATGGTCC AAAGAAGACT TCGGCACTGC 1740 

AGACTTAAAA CCAGCTTTCT AGCAGAGGCA GAAGGATCTA GAGCCAAAGG CAAAGACTTG 1800 

AATAGGCTGG GAAGATGCAA GAATGGCATT TTACATAAAG AACACTCTCT CCTTTTCCAG 1860 

CCAGCACACT TGCATAGAAA TTAAGTTTTA CACTTGAAGT TCTTTGTTTC CATCCTGAGA 1920 

AGCTCCAAAG TCTGAGGTGG TGTGGTATGC TGGGTAATTC TCCCCACCCC CCAACATTCC 1980 

CTGGGGGTTC CATGGGGGTA GCTTCTCCCA AGGACTTCCA GCGGCAACAC AGAAATCCCA 2040 

CTTCGAGACA AAGGAGTTAC TGCTTAAATC AGGCCCTAAT TTCCAAGGTT CCCTTTGCTT 2100 

AAAGTTCCCT AGAGGACCAT CTCACTTCTA AAGAAAAGGT GTATTCGGGG ACCCATCCTC 2160 

AACCTCCTTG TTATGGAAGG AGACTTCGGG AACAGAGCAA GGGCTGAGCC TCCGGCAGTT 2220 

TGGGGTAAGG TTGGGGTTGG GGGGAGCAAG GAAGGCAAGT GAGGCTGGAG GCCCAGGGAT 2280 

AGGGGAAGAT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT CTCGGGGATG 2340 

GTGGTGGTGG ACAACTAGGA AACTCTGGCG CTTTCTCCTC CCCTCACAAA ACTGAGTCCA 24 00 

©CTGGAGCCG CCTCCAGACT CTCTGGCCAG GGCCTCAGAG TGGTCAACAG TCCCTGGCCA 2460 

^GCGTTGCTCT CTCCAGGCTA AGGGCACCCA CTCCCCTGGA GATTCCTGAA CCTGGGCCAG 2520 

mAGAGCCGA ATTAGACAAG TGTCTCCAAT CCGGCTGCGT GCGGATTTTG TTGCGGTGTC 2580 

liCTCGGTTGT CTGCAGTTCC TTTAGTCCCT TCCCTGGCCT GCCCCTTACA CCTCCACACA 2640 

'gGTCCCCCTC TGTGTAGGAA TACACCAGAC CCTCTCTTAG CCACACACAC CTCCAGTCCC 2700 

eCGTCTACCT AGATTTTTTT CATAGCTAGT TGGATGGGGG ATGGGTTAGG GAGGCTGGGT 2760 

ItGCGAGCCT CCAGGTGGGA GTTCACCGAC AGGTACTCCG CAAAGGAGCT GGAAGGCAGG 282 0 

ICTGGAAAAC TGTCCCCCAG ATTTAGGATT CTGGGCAGCT TCCATCAGCT TATACTTTGG 2880 

''dTCCCCCGCC CCCTAAACTC CCCATCCCCA CCTTCCTTTC TCCCGTTACT TCGTCCTCCC 2940 

^-CGCCTTTCC AGCCTTGAGT CTAAAGCTCC ATGCTTATGC CTCTGCAAAC AACCCCCTCC 3000 

feTCTAACCC CAGCAGAACT CCGAGGAAAG GGGCCGGAGG CCCCCCTTCT CGCCTGTGGT 3060 

^AGAGGGGGC AGTGTGGCAG TCCCAAGTGG GGGCGACCGG AGGCCGTCTC GGTGCCCCGC 3120 

^CGATCAGGC CACTGGGCAC ATCGGGGGCG GGAAGCTGGG CTCACCAAAG GGGCGACTGG 3180 

iCTTGGCAGG TGTGGGCTCT GGTCCGGCCT GGGCAGGCTC CGGGGGCGGG GTCTCAGGTT 3240 

^CAGCCCCGC GGGGGGCTGG GGGGCGGCCC GCGGTTTGGG CTGGTTTGCC AGCCTTTGGA 33 00 

SCGACCGGGA GCATATAACC GGAGCCTCTG CTGGGAGAAG ACGCAGAGCG CCGCTGGGCT 3360 

GCCGGGTCTC CTGCCTCCTC CTCCTGCTCC TAGAGCCTCC TGCATGAGGG CGCGGTAGAG 3420 

ACCCGGACCC GCTCCGTGCT CTGCCGCCTC GCCGAGCTTC GCCCGCAAGC TGGGGAATTC 3480 

GGATCCCCGG GATCGAAAGA GCCTGCTAAA GCAAAAAAGA AGTCACCATG TCGTTTACTT 3540 

TGACCAACAA GAACGTGATT TTCGTTGCCG GTCTGGGAGG CATTGGTCTG GACACCAGCA 3600 

AGGAGCTGCT CAAGCGCGAT CCCGTCGTTT TACAACGTCG TGACTGGGAA AACCCTGGCG 3660 

TTACCCAACT TAATCGCCTT GCAGCACATC CCCCTTTCGC CAGCTGGCTT TATAGCGAAG 3720 

AGGCCCGCAC CGATCGCCCT TCCCAACAGT TGCGCAGCCT GAATGGCGAA TGGCGCTTTG 3780 

CCTGGTTTCC GGCACCAGAA GCGGTGCCGG AAAGCTGGCT GGAGTGCGAT CTTCCTGAGG 3840 

CCGATACTGT CGTCGTCCCC TCAAACTGGC AGATGCACGG TTACGATGCG CCCATCTACA 3900 

CCAACGTAAC CTATTCCATT ACGGTCAATC CGCCGTTTGT TCCCACGGAG AATCCGACGG 3960 

GTTGTTACTC GCTCACATTT AATGTTGATG AAAGCTGGCT ACAGGAAGGC CAGACGCGAA 4020 

TTATTTTTGA TGGCGTTAAC TTGGCGTTTC ATCTGTGGTG CAACGTGCGC TGGGTCGGTT 4080 

ACGGCCAGGA CAGTCGTTTG CCGTCTGAAT TTGACCTGAG CGCATTTTTA CGCGCCGGAG 4140 

AAAACCGCCT CGCGGTGATG GTGCTGCGTT GGAGTGACGG CAGTTATCTG GAAGATCAGG 4200 

ATATGTGGCG GATGAGCGGC ATTTTCCGTG ACGTCTCGTT GCTGCATAAA CCGACTACAC 4260 
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A7ATCAGCGA TTTCCATGTT GCCACTCGCT TTAATGATGA TTTCAGCCGC GCTGAACTGG 4320 

AGGCTGAAGT TCAGATGTGC GGCGAGTTGC GTGACTACCT ACGGGTAACA GTTTCTTTAT 4380 

GGCAGGGTGA AACGCAGGTC GCCAGCGGCA CCGCGCCTTT CGGCGGTGAA ATTATCGATG 4440 

AGCGTGGTGG TTATGCCGAT CGCGTCACAC TACGTCTGAA CGTCGAAAAC CCGAAACTGT 4500 

GGAGCGCCGA AATCCCGAAT CTCTATCGTG CGGTGGTTGA ACTGCACACC GCCGACGGCA 4560 

CGCTGATTGA AGCAGAAGCC TGCGATGTCG GTTTCCGCGA GGTGCGGATT GAAAATGGTC 4620 

TGCTGCTGCT GAACGGCAAG CCGTTGCTGA TTCGAGGCGT TAACCGTCAC GAGCATCATC 4680 

CTCTGCATGG TCAGGTCATG GATGAGCAGA CGATGGTGCA GGATATCCTG CTGATGAAGC 4740 

AGAACAACTT TAACGCCGTG CGCTGTTCGC ATTATCCGAA CCATCCGCTG TGGTACACGC 4800 

TGTGCGACCG CTACGGCCTG TATGTGGTGG ATGAAGCCAA TATTGAAACC CACGGCATGG 4860 

TGCCAATG;^ TCTGCTGACC GATGATCCGC GCTGGCTACC GGCGATGAGC GAACGCGTAA 4920 

CGCGAATGGT GCAGCGCGAT CGTAATCACC CGAGTGTGAT CATCTGGTCG CTGGGGAATG 4980 

AATCAGGCCA CGGCGCTAAT CACGACGCGC TGTATCGCTG GATCAAATCT GTCGATCCTT 5040 

CCCGCCCGGT GCAGTATGAA GGCGGCGGAG CCGACACCAC GGCCACCGAT ATTATTTGCC 5100 

CGATGTACGC GCGCGTGGAT GAAGACCAGC CCTTCCCGGC TGTGCCGAAA TGGTCCATCA 5160 

AAAAATGGCT TTCGCTACCT GGAGAGACGC GCCCGCTGAT CCTTTGCGAA TACGCCCACG 5220 

CGATGGGTAA CAGTCTTGGC GGTTTCGCTA AATACTGGCA GGCGTTTCGT CAGTATCCCC 5280 

GTTTACAGGG CGGCTTCGTC TGGGACTGGG TGGATCAGTC GCTGATTAAA TATGATGAAA 5340 

ACGGCAACCC GTGGTCGGCT TACGGCGGTG ATTTTGGCGA TACGCCGAAC CATCGCCAGT 5400 

JIjCTGTATGAA CGGTCTGGTC TTTGCCGACC GCACGCCGCA TCCAGCGCTG ACGGAAGCAA 5460 

AACACCAGCA GCAGTTTTTC CAGTTCCGTT TATCCGGGCA AACCATCGAA GTGACCAGCG 5520 

^TACCTGTT CCGTCATAGC GATAACGAGC TCCTGCACTG GATGGTGGCG CTGGATGGTA 5580 

MCCGCTGGC AAGCGGTGAA GTGCCTCTGG ATGTCGCTCC ACAAGGTAAA CAGTTGATTG 564 0 

MVCTGCCTGA ACTACCGCAG CCGGAGAGCG CCGGGCAACT CTGGCTCACA GTACGCGTAG 5700 

itCAACCGAA CGCGACCGGA TGGTCAGAAG CCGGGCACAT CAGCGCCTGG CAGCAGTGGC 5760 

#CTGGCGGA AAACCTCAGT GTGACGCTCC CCGCCGCGTC CCACGCCATC CCGCATCTGA 5820 

UfeACCAGCGA AATGGATTTT TGCATCGAGC TGGGTAATAA GCGTTGGCAA TTTAACCGCC 5880 

S3TCAGGCTT TCTTTCACAG CTGTGGATTG GCGATAAAAA ACAACTGCTG ACGCCGCTGC 5940 

GCGATCAGTT CACCCGTGCA CCGCTGGATA ACGACATTGG CGTAAGTGAA GCGACCCGCA 6000 

lEI-GACCCTAA CGCCTGGGTC GAACGCTGGA AGGCGGCGGG CCATTACCAG GCCGAAGCAG 6060 

GGTTGTTGCA GTGCACGGCA GATACACTTG CTGATGCGGT GCTGATTACG ACCGCTCACG 6120 

PGTGGCAGCA TCAGGGGAAA ACCTTATTTA TCAGCCGGAA AACCTACCGG ATTGATGGTA 6180 

^GGTCAAAT GGCGATTACC GTTGATGTTG AAGTGGCGAG CGATACACCG CATCCGGCGC 624 0 

toTTGGCCT GAACTGCCAG CTGGCGCAGG TAGCAGAGCG GGTAAACTGG CTCGGATTAG 6300 

gsCCGCAAGA AAACTATCCC GACCGCCTTA CTGCCGCCTG TTTTGACCGC TGGGATCTGC 6360 

dVTTGTCAGA CATGTATACC CCGTACGTCT TCCCGAGCGA AAACGGTCTG CGCTGCGGGA 6420 

CGCGCGAATT GAATTATGGC CCACACCAGT GGCGCGGCGA CTTCCAGTTC AACATCAGCC 6480 

GCTACAGTCA ACAGCAACTG ATGGAAACCA GCCATCGCCA TCTGCTGCAC GCGGAAGAAG 6540 

GCACATGGCT GAATATCGAC GGTTTCCATA TGGGGATTGG TGGCGACGAC TCCTGGAGCC 6600 

CGTCAGTATC GGCGGAATTA CAGCTGAGCG CCGGTCGCTA CCATTACCAG TTGGTCTGGT 6660 

GTCAAAAATA ATAATAACCG GCAGGCCATG TCTGAAAGTA TTCGCGTAAG GAAATCCATT 6720 

ATGTACTATT TAAAAAACAC AAACTTTTGG ATGTTCGGTT TATTCTTTTT CTTTTACTTT 6780 

TTTATCATGG GAGCCTACTT CCCGTTTTTC CCGATTTGGC TACATGACAT CAACCATATG 684 0 

AGCAAAAGTG ATACGGGTAT TATTTTTGCC GCTATTTCTC TGTTGTCGCT ATTATTCCAA 6900 

CCGCTGTTGG TCTGCTTTCT GACAAACTCG GCCTCGACTC TAGACTGAGA ACTTCAGGGT 6960 

GAGTTTGGGG ACCCTTGATT GTTCTTTCTT TTTCGCTATT GAAAAATTCA TGTTATATGG 7020 

AGGGGGCAAA GTTTTCAGGG TGTTGTTTAG AATGGGAAGA TGTCCCTTGT ATCACCATGG 7080 

ACCCTCATGA TAATTTTGTT TCTTTCACTT TCTACTCTGT TGACAACCAT TGTCTCCTCT 7140 

TATTTTCTTT TCATTTTCTG TAACTTTTTT CGTTAAACTT TAGCTTGCAT TTGTAACGAA 7200 

TTTTTAAATT CACTTTCGTT TATTTGTCAG ATTGTAAGTA CTTTCTCTAA TCACTTTTTT 7260 

TTCAAGGCAA TCAGGGTAAT TATATTGTAC TTCAGCACAG TTTTAGAGAA CAATTGTTAT 7320 
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AATTAAATGA 
TATTGGTAGA 
ATATACACTG 
CCATGTTCAT 
GTCTCATCAT 
GGCTGGTGTG 



TAAGGTAGAA 
AACAACTACA 
TTTGAGATGA 
GCCTTCTTCT 
TTTGGCAAAG 
GCCAATGCCC 



TATTTCTGCA 
TCCTGGTAAT 
GGATAAAATA 
TTTTCCTACA 
AATTCACTCC 
TGGCTCACAA 



TATAAATTCT 
CATCCTGCCT 
CTCTGAGTCC 
GCTCCTGGGC 
TCAGGTGCAG 
ATACCACTGA 



GGCTGGCGTG 
TTCTCTTTAT 
AAACCGGGCC 
AACGTGCTGG 
GCTGCCTATC 
GATC 



GAAATATTCT 
GGTTACAATG 
CCTCTGCTAA 
TTGTTGTGCT 
AGAAGGTGGT 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



7380 
7440 
7500 
7560 
7620 
7664 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CGAGGGCCTG CTCGATCTCC 

□ (2) INFORMATION FOR SEQ ID NO: 12: 

M (i) SEQUENCE CHARACTERISTICS: 

y' (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
J (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

f_ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12; 

Jgcattccac cactgctccc 

□ (2) INFORMATION FOR SEQ ID NO: 13: 

6 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
GAGCACCCTT CTCATGACCT C 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 



20 



20 



21 
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(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GTTGGTGTAG ATGGGCGCAT CG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
IgPGGGGTCTC AGGTTACAGC C 

...a (2) INFORMATION FOR SEQ ID NO: 16: 

Cii (i) SEQUENCE CHARACTERISTICS: 
P (A) LENGTH: 23 base pairs 

J^^ (B) TYPE: nucleic acid 

£ (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 



M (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

ru 

S:CCTCTGGC CTGCTGGCTC ATG 

O (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
CAGGAGAGTC TTGCCTGTAT CCTC 



(2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CAAGATGCAT CCAGGGGTCC TGGCTGCCTT CCTCTTCTTG AGCTGGACTC ATTGTCGGGC 60 

CCTGCCCCTT CCCAGTGGTG GTGATGAAGA TGATTTGTCT GAGGAAGACC TCCAGTTTGC 120 

AGAGCGCTAC CTGAGATCAT ACTACCATCC TACAAATCTC GCGGGAATCC TGAAGGAGAA 180 

TGCAGCAAGC TCCATGACTG AGAGGCTCCG AGAAATGCAG TCTTTCTTCG GCTTAGAGGT 240 

GACTGGCAAA CTTGACGATA ACACCTTAGA TGTCATGAAA AAGCCAAGAT GCGGGGTTGT 3 00 

CGATGTGGG TGAATACAATG TTTTCCCTCG AACTCTTAAA TGGTCCAAAA TGAATTTAAC 360 

CTACAGAATT GTGAATTACA CCCCTGATAT GACTCATTCT GAAGTCGAAA AGGCATTCAA 420 

AAAAGCCTTC AAAGTTTGGT CCGATGTAAC TCCTCTGAAT TTTACCAGAC TTCACGATGG 480 

CATTGCTGAC ATCATGATCT CTTTTGGAAT TAAGGAGCAT GGCGACTTCT ACCCATTTGA 54 0 

CIgGGCCCTCT GGCCTGCTGG CTCATGCTTT TCCTCCTGGG CCAAATTATG GAGGAGATGC 600 

'tCATTTTGAT GATGATGAAA CCTGGACAAG TAGTTCCAAA GGCTACAACT TGTTTCTTGT 660 

'^GCTGCGCAT GAGTTCGGCC ACTCCTTAGG TCTTGACCAC TCCAAGGACC CTGGAGCACT 72 0 

!=€ATGTTTCCT ATCTACACCT ACACCGGCAA AAGCCACTTT ATGCTTCCTG ATGACGATGT 780 

'^ACAAGGGATC CAGTCTCTCT ATGGTCCAGG AGATGAAGAC CCCAACCCTA AACATCCAAA 840 

mACGCCAGAC AAATGTGACC CTTCCTTATC CCTTGATGCC ATTACCAGTC TCCGAGGAGA 900 

ikACAATGATC TTTAAAGACA GATTCTTCTG GCGCCTGCAT CCTCAGCAGG TTGATGCGGA 960 

FgCTGTTTTTA ACGAAATCAT TTTGGCCAGA ACTTCCCAAC CGTATTGATG CTGCATATGA 102 0 

GCACCCTTCT CATGACCTCA TCTTCATCTT CAGAGGTAGA AAATTTTGGG CTCTTAATGG 1080 

'tTATGACATT CTGGAAGGTT ATCCCAAAAA AATATCTGAA CTGGGTCTTC CAAAAGAAGT 1140 

^TAAGAAGATA AGTGCAGCTG TTCACTTTGA GGATACAGGC AAGACTCTCC TGTTCTCAGG 1200 

ilAACCAGGTC TGGAGATATG ATGATACTAA CCATATTATG GATAAAGACT ATCCGAGACT 1260 

IIaTAGAAGAA GACTTCCCAG GAATTGGTGA TAAAGTAGAT GCTGTCTATG AGAAAAATGG 1320 

'^TATATCTAT TTTTTCAACG GACCCATACA GTTTGAATAC AGCATCTGGA GTAACCGTAT 1380 

3GTTCGCGTC ATGCCAGCAA ATTCCATTTT GTGGTGTTAA GTGTCTTTTT AAAAATTGTT 1440 

^=1tTTAAATCC TGAAGAGCAT TTGGGGTAAT ACTTCCAGAA GTGCGGGGTA GGGGAAGAAG 1500 

AGCTATCAGG AGAAAGCTTG G ^^^^ 

(2) INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
Pro Arg Cys Gly Xaa Pro Asp 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
His Glu Xaa Gly His Xaa Xaa Xaa Xaa Xaa His Ser 
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(2) INFORMATION FOR SEQ ID NO: 21: 

O (i) SEQUENCE CHARACTERISTICS: 

m (A) LENGTH: 471 amino acids 



(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



flyiet His 


Pro 


Gly 


Val 


Leu 


Ala 


Ala 


Phe 


Leu 


Phe 


Leu 


Ser 


Trp 


Thr 


His 






5 










10 










15 




5J:ys Arg 


Ala 


Leu 


Pro 


Leu 


Pro 


Ser 


Gly 


Gly 


Asp 


Glu Asp 


Asp 


Leu 


Ser 






20 










25 










30 




His 


QjIu Glu 

Qi 


Asp 


Leu 


Gin 


Phe 


Ala 


Glu 


Arg 


Tyr 


Leu 


Arg 


Ser 


Tyr 


Tyr 


35 










40 










45 








Pro Thr 


Asn 


Leu 


Ala 


Gly 


He 


Leu 


Lys 


Glu 


Asn 


Ala 


Ala 


Ser 


Ser 


Met 


50 








55 










60 










Thr Glu 


Arg 


Leu 


Arg 


Glu 


Met 


Gin 


Ser 


Phe 


Phe 


Gly 


Leu 


Glu 


Val 


Thr 


65 




70 










75 










80 


Gly Lys 


Leu 


Asp 


Asp 


Asn 


Thr 


Leu 


Asp 


Val 


Met 


Lys 


Lys 


Pro 


Arg 


Cys 






85 










90 










95 




Gly Gly 


Val 


Asp 


Val 


Gly 


Glu 


Tyr 


Asn 


Val 


Phe 


Pro 


Arg 


Thr 


Leu 


Lys 




100 










105 










110 






Trp Ser 


Lys 


Met 


Asn 


Leu 


Thr 


Tyr 


Arg 


He 


Val 


Asn 


Tyr 


Thr 


Pro 


Asp 


115 










120 










125 






Val 


Met Thr 


His 


Ser 


Glu 


Val 


Glu 


Lys 


Ala 


Phe 


Lys 


Lys 


Ala 


Phe 


Lys 


130 










135 










140 








He 


Trp Ser 


Asp 


Val 


Thr 


Pro 


Leu 


Asn 


Phe 


Thr 


Arg 


Leu 


His 


Asp 


Gly 


145 






150 










155 










160 


Ala Asp 


He 


Met 


He 


Ser 


Phe 


Gly 


He 


Lys 


Glu 


His 


Gly 


Asp 


Phe 


Tyr 



15 











165 




Pro 


Phe Asp 


Gly 


Pro Ser 


Gly 








180 






Pro 


Asn 


Tyr 
195 


Gly 


Gly Asp 


Ala 


Ser 


Ser 
210 


Ser 


Lys 


Gly Tyr 


Asn 
215 


Gly His 


Ser 


Leu 


Gly Leu 


Asp 


225 








230 




Phe 


Pro 


He 


Tyr 


Thr Tyr 
245 


Thr 


Asp Asp 


Val 


Gin 


Gly He 


Gin 








260 






Pro 


Asn 


Pro 
275 


Lys 


His Pro 


Lys 


Ser 


Leu Asp 


Ala 


He Thr 


Ser 




290 








295 


Asp Arg 


Phe 


Phe 


Trp Arg 


Leu 


305 








310 




DPhe 


Leu 


Thr 


Lys 


Ser Phe 


Trp 










325 




QAla 


Tyr 


Glu 


His 


Pro Ser 


His 






340 








Phe 


Trp 


Ala 


Leu Asn 


Gly 






355 








He 


Ser 


Glu 


Leu Gly 


Leu 


370 








375 




Val 


His 


Phe 


Glu Asp 


Thr 


L385 








390 




riGln 


Val 


Trp 


Arg 


Tyr Asp 


Asp 








405 






Arg 


Leu 


He 


Glu Glu 


Asp 


Pi 






420 






^==n^^Ala 


Val 


Tyr 


Glu 


Lys Asn 


Gly 


n 




435 








Gin 


Phe 
450 


Glu 


Tyr 


Ser He 


Trp 
455 


Ala 


Asn 


Ser 


He 


Leu Trp 


Cys 


465 








470 







170 








175 




Leu Leu 


Ala 


His 


Ala Phe 


Pro 


Pro 


Gly 


185 








190 






His Phe 


Asp 


Asp 


Asp Glu 


Thr 


Trp 


Thr 


200 






205 








Leu Phe 


Leu 


Val 


Ala Ala 


His 


Glu 


Phe 








220 








His Ser 


Lys 


Asp 


Pro Gly 


Ala 


Leu 


Met 




235 








240 


Gly Lys 


Ser 


His 


Phe Met 


Leu 


Pro 


Asp 


.250 








255 




Ser Leu Tyr Gly Pro Gly 


Asp 


Glu Asp 


265 








270 






Thr Pro Asp 


Lys 


Cys Asp 


Pro 


Ser 


Leu 


280 






285 








Leu Arg Gly Glu 


Thr Met 


He 


Phe 


Lys 








300 








His Pro 


Gin 


Gin 


Val Asp 


Ala 


Glu 


Leu 






315 








320 


Pro Glu 


Leu 


Pro Asn Arg 


He 


Asp 


Ala 




330 








335 




Asp Leu 


He 


Phe 


He Phe 


Arg Gly Arg 


345 








350 






Tyr Asp 


He 


Leu Glu Gly 


Tyr 


Pro 


Lys 


360 






365 








Pro Lys 


Glu 


Val 


Lys Lys 


He 


Ser 


Ala 






380 








Gly Lys 


Thr 


Leu 


Leu Phe 


Ser 


Gly Asn 




395 








400 


Thr Asn 


His 


He 


Met Asp 


Lys 


Asp 


Tyr 




410 








415 




Phe Pro 


Gly 


He 


Gly Asp 


Lys 


Val 


Asp 


425 








430 






Tyr He 


Tyr 


Phe 


Phe Asn 


Gly 


Pro 


He 


440 






445 








Ser Asn Arg 


He 


Val Arg 


Val 


Met 


Pro 



460 
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